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I.  INTRODUCTION 

2 

The  x goodness  of  fit  test  is  by  far  the  most  widely  used  statisti- 
cal technique  for  investigating  the  "adequacy"  of  a probability  model  in 
describing  an  observable  random  phenomenon.  The  procedure  has  been  sub- 
jected to  a considerable  amount  of  quite  justifiable  criticism.  Among  the 
arguments  against  the  procedure  is  the  simple  observation  that  no  proba- 
bility model  constitutes  an  exact  description  of  a stochastic  process, 
and  thus  the  null  hypothesis  that  a probability  model  describes  a process 
is  known  a priori  to  be  false.  Moreover,  given  a sufficiently  large 
sample,  any  null  hypothesis  that  a fixed  probability  model  holds  will  be 
rejected  with  high  probability.  Thus,  for  large  samples,  departures  from 
a probability  model  that  are  of  no  practical  significance  may  result  in 
a strong  rejection  of  the  hypothesis  that  this  probability  model  describes 

the  random  process.  Be  that  as  it  may,  there  remains  an  anomaly  of  even 

2 

greater  concern.  The  most  common  use  of  the  x goodness  of  fit  test  is 


3 

the  validation  rather  than  the  rejection  of  a given  family  of  probability 

distributions.  In  other  words,  it  is  comnon  practice  to  interpret  the 

acceptance  of  a null  hypothesis  as  a validation  of  a probability  model, 

noting  that  such  acceptance  has  shown  that  the  data  is  not  inconsistent 

with  the  model  being  entertained.  It  is  a giant  step  indeed  to  the 

analysis  that  follows,  in  which  the  probability  model  is  tacitly  assumed 

to  hold.  The  Information  necessary  to  justify  this  step  is  rarely  available. 

The  crucial  information  consists  of  a thorough  knowledge  of  the  power  of 
2 

the  x test  at  a fairly  broad  set  of  reasonable  alternatives  to  the  null 

model.  In  the  absence  of  such  information,  the  experimenter  remains 

unsure  about  the  proper  interpretation  of  acceptance,  and  may  be  led  to 

accept  a model  only  because  the  test  procedure  employed  is  unable  to  detect 

certain  types  of  departures  from  this  model. 

2 

Another  peculiarity  of  the  x goodness  of  fit  test  is  the  fact  that 

a variety  of  substantially  different  probability  models  may  be  accepted 

on  the  basis  of  the  test.  Crow  and  Bardwell  (1965)  examined  the  fit  of  a 

collection  of  discrete  probability  models  to  several  famous  data  sets. 

Models  investigated  include  the  Poisson,  Neyman  type  A,  Poisson-binomial, 

hyper-Poisson  and  Charller  type  B series  distributions.  The  data  obtained 

by  Rutherford  and  Geiger  on  the  number  of  alpha  particles  emitted  by  a bar 

of  polonium  were  fit  by  these  distributions,  and  it  was  shown  that  the 

2 

Poisson,  type  B and  hyper-Poisson  all  fit  well,  that  is,  had  x statistics 

with  P-values  around  .20.  We  have  fit  a convoluted  Poisson  distribution 

to  this  data,  the  convolution  of  Poisson  and  Bernoulli  distributions,  and 
2 

obtained  a x statistic  with  a P-value  in  the  same  neighborhood.  The 

2 

utility  of  performing  a x test  might  reasonably  be  questioned  in  the  light 


of  such  ambiguous  results. 
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test  is  not  directional,  that  is,  is  not  designed  with  particular  alterna- 
tives in  mind,  one  would  expect  that  there  are  more  powerful  tests  available. 
For  the  two  families  of  signal  plus  noise  distributions  investigated  in 
this  paper,  we  have  estimated  the  power  of  likelihood  ratio  tests  for  com- 
parison purposes.  As  predicted,  the  likelihood  ratio  procedure  is  superior, 
sometimes  strikingly  so. 

Tests  for  detecting  the  presence  of  signal  plus  noise  distributions 
have  received  attention  in  a recent  paper  by  Sclove  (1977).  In  that  paper, 
tests  are  proposed  for  the  hypothesis  of  a Poisson-Normal  convolution 
against  the  broad  alternative  of  an  infinitely  divisible  distribution.  A 
test  is  also  proposed  for  the  hypothesis  of  normality  against  the  alterna- 
tive of  a Poisson-Normal  convolution.  These  tests  are  based  on  moment 
estimators  of  cumulants  of  the  underlying  distribution,  and  the  power  of 
these  tests  remains  to  be  investigated. 

In  the  next  section,  we  derive  some  results  concerning  the  maximum 
likelihood  estimates  of  parameters  of  the  signal  plus  noise  distributions 
studied  here.  These  estimates  are  later  used  in  computing  the  likelihood 
ratio  statistic.  In  particular,  we  derive  the  maximum  likelihood  estimates 
of  the  parameters  of  the  geometric-Bernoulll  convolution  and  establish  a 
relation  between  the  MLE's  of  the  parameters  of  the  Polsson-Bernoulll  con- 
volution which  facilitates  the  numerical  search  for  the  Ml£'s.  In 
Section  III,  we  describe  the  procedures  used  in  the  generation  of  random 
samples  from  these  distributions,  and  make  remarks  on  the  numerical  methods 
used.  Tables  sumnar icing  our  simulation  study  are  given  in  Section  IV, 


II.  MAXIMUM  LIKELIHOOD  ESTIMATION 

Two  signal  plus  noise  disCributions  are  studied  in  this  paper: 

(1)  The  distribution  of  the  sum  X * Y + Z of  independent  variables  Y and 
Z where  Y has  a geometric  distribution  with  parameter  tt  and  Z is  a 
Bernoulli  variable  with  parameter  p.  The  probability  mass  function  of 
the  variable  X is  given  by 


P(X«x|tt,p) 


x-1 


if  x = 0 
if  x-1, 2,*' 
otherwise. 


(2)  The  distribution  of  the  sum  X - Y + Z of  independent  variables  where 
Y has  a Poisson  distribution  with  parameter  6 and  Z is  Bernoulli  with 
parameter  p.  The  probability  mass  function  of  this  variable  is  given  by 


-8„x- 1 

P(X-x|e,p)  - - — — ; — ((l-p)Ofpx),  x-0, 1,2,  • • • 


Maximum  likelihood  estimation  for  samples  from  either  of  these  distribu- 
tions when  the  Bernoulli  parameter  p is  known  is  a simple  application  of 
work  in  Samanlego  (1976)  and  Samaniego  (1977).  He  address  below  the  problem 
of  maximum  likelihood  estimation  of  the  parameter  pairs  in  these  two  families 
of  distributions.  Prior  to  our  derivations  in  this  regard,  we  examine  the 
question  of  ldentlf lability  of  these  two-parameter  families.  Sclove  and 
Van  Ryzin  (1969)  obtained  moment  estimators  f variety  of  signal  plus 
noise  distributions,  including  the  Pols son -binomial  convolution,  and  established 


the  ldentif lability  of  this  distribution  in  that  paper.  We  therefore 
turn  our  attention  to  the  geometric-Bernoulli  convolution,  which  can  be 
shown  to  be  identifiable  as  follows.  If  X is  distributed  as  the  sum 
of  independent  geometric  and  Bernoulli  variables  Y and  Z,  we  show  that 
the  first  and  second  factorial  moments  of  X are  in  one-to-one  corres- 
pondence with  the  parameter  pair  (tt,p).  This  is  a sufficient  condition 
for  identlf lability.  We  have 


EY(Y-l) 


( 1-tt)  * 


and  thus 


EZ  - p,  EZ(Z-l)  - 0. 


"l  " + P 


m2  = E(X(X-1) ) 

2tT  . , TT  . 

-1^  (P  + I^)* 


m2  " T-tt  ml 


2m1+m2 


P “ “l  ‘ 2^  • 


Thus,  the  pairs  (n,p)  and  (m^,!^)  are  in  one-to-one  correspondence. 
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Suppose  a sample  of  size  n is  taken  from  a geometric-Bernoulli 
convolution.  Let  Xq,x^, . . . ,x^  be  the  k distinct  integers  observed, 
with  frequencies  n^,n^, . . . ,n^.  We  assume  that  Xq  * 0 and  will  treat  the 

cases  nQ  = 0 and  n^  > 0 separately.  All  other  n^  are  assumed  positive 

k 

with  £ n,  = n.  The  likelihood  function  for  this  sample  may  be  written  as 
i-0  1 


n„  k 


L(x,n,p)  = (p(X-0 ))  0 n (p(x-xt))  1 


i=l 


(l-p)n(^l-TT)n°  n((l-P)(l-TT)TTXi  + pd-TT)/1  S * 
i=l 


n-n, 


(l-p)“0(l-TT)n  V ^^(l-p)^)' 


.n-n„ 


where  S - £ n^,  that  is,  S is  the  sum  of  the  n observations.  The 


i-1 


values  of  p and  n for  which  L > 0 differ  for  different  samples.  We 
decompose  the  problem  of  imtImm  likelihood  estimation  into  several  mutually 
exclusive  and  exhaustive  cases. 


Case  1.  S - n - nQ  (that  Is,  all  observations  are  either  zero  or  one), 
(a)  S - n - nQ,  nQ  - n.  In  this  case,  the  likelihood  is  equal  to 
L - <l-p)n(l-Ti)n 
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which  is  positive  for  tt  < 1,  p < 1,  and  is  maximized  over  the  unit  square 
at  tt  « 0,  p « 0. 

(b)  S ■ n - n^,  0 < n^  < n.  Here, 

L = ( 1-P ) °(1-tt)I^(1-p)tH'P^)  ° 

is  positive  for  0 < n < 1 and  - -j—  < p < 1. 

Differentiating  £n  L one  obtains 


d_ 

dTT 


In  L 


_n_  (n-OpKl-p) 
1-TT  + p + TT(l-p) 


(2.1) 


a n (n-nn) (1-tt) 

in  L = - — + ° ■■  - 0. 

dp  1-p  p + n(l-p) 


Solving  (2.2)  for  p,  we  have 


p = 1 - 


n(l-n) 


Substituting  into  (2.1),  we  obtain 


(n-n0>G(iL^ 


1-TT 


n(l-n) 


n , V^V 
1-TT  + n(l-n)  - nQ  + nQn 


n , n0<n-nQ> 
1-TT  + (n-nn)(l-TT) 


nQ~P 
1-TT  * 


(2.2) 


(2.3) 


nil  ' f .. 


Since  (2.3)  has  no  solution,  there  are  no  critical  points  in  this  case. 
Inspecting  the  boundary  of  the  unit  square,  one  easily  obtains  that 
(tt,P)  “ (0,  S/n)  is  the  ML£ . 

(c)  S - n - nQ,  nQ  - 0.  Here, 

L = (l-TT)n(p  + n(l-p))n 

which  is  positive  for  0 < tt  < 1,  - y^  < p < 1.  Differentiating  in  L one 
obtains 


f lnL.-^+  V = 0 

&TT  1-Tt  P + TT(l-p) 


f m L - ; = 0, 

dp  p + TT(l-p) 


which,  by  inspection,  has  no  solution.  Again,  inspecting  the  boundary  of 
the  unit  square,  we  find  the  MI£  to  be  (f),p)  ■ (0,1). 


Case  2.  S > n - n^  (that  is,  at  least  one  observation  exceeds  1). 
(a)  S > n - n^,  n^  - 0.  The  likelihood  is  given  by 

L - (l-TT)nTTS‘n((l-p)TT  + p)n 


which  is  positive  for  0 < tt  < 1 and  - y^  < p < 1.  Now 


f to  L - - + 22!  + . 0 

dTT  1-TT  TT  (l-p)TT  + P 


f to  L - „»<?-">  - 0. 

dp  (t'P>n  + p 


These  equations  have  no  solution.  Inspection  of  the  boundary  of  the  unit 
square  Identifies  (ft,(>)  - (~r~,  1)  as  the  MLE. 

u 


which  is  positive  for  0 < tt  < 1 


Solving  (2.5)  for  p,  we  obtain 


Substituting  (2.6)  in  (2.4),  we  find 


We  show  that  this  pair  (fi,p)  maximises  L,  whether  or  not  (ft,p)  is  in  the 
unit  square,  that  is,  whether  or  not  p > 0.  The  second  partial  derivatives 


evaluated  at  (ft,f>),  can  be  shown  to  be  equal  to 


I - 
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in  L 


dri 


(TT.p) 


nS 


in  L 


dp 


dpdn 


(ft.p) 


2 2 

+ ^f_  + -U^ 

> „ .2  S-nfnn  , .3 

(n-nQ)  0 (n-nQ) 


n (n-nQ) 
S2n_ 


in  L 


.«  *.  n-n_ 

(TT.p)  0 


To  show  that  the  Hessian  is  negative  definite  at  (fi,p),  we  note  that  the 
diagonal  terms  are  negative,  and  the  determinant  of  the  Hessian  is 


3,  x 3 

n (n-n.)  n n 
+ -7-  . x + 


Vn-V  “oCS-O+Hq)  (n-n  )-*  (n-n.)' 


3 3 

n (n-nQ)  n (n-nQ) 


n0(n-n0>  n0(S'ftfn0>  <n-nQ)2 


n3  n (n-nQ) 


nQ  n0(S-nfnQ)  * 


which  is  positive. 

Now  p < 0 if  and  only  if  S > n(n-ng)/ng,  and  in  this  case,  the  solution 
(ft,f>)  of  the  likelihood  equations  given  by  (2.7)  and  (2.8)  is  not  the  maximum 
likelihood  estimate.  Since  there  is  no  critical  point  in  the  unit  square, 

C 

we  check  the  boundary  to  obtain  the  KLE  (ft,p)  “ ^s+n*  * u8e^u* 


coincidence  that  the  inequality 


S > 


n(n-nQ) 


12 


is  equivalent  to 


S 

S+n 


< 


S-n+n. 


The  MLE  in  Case  2 (b)  may  thus  be  written 


(ft.p) 


S-tH-n 
S 


^)»  ®ax(j). 


Snn 

—j)' 

n(n-nQr 


(2.9) 


One  may  easily  check  that  (ft,p)  given  by  (2.9)  is  in  fact  the  MLE  for 
samples  containing  at  least  one  nonzero  observation  (that  is,  for  all  cases 
except  1 (a)),  the  MLE  being  (fl,p)  - (0,0)  otherwise. 

We  turn  our  attention  to  the  problem  of  maximum  likelihood  estimation 
for  samples  from  the  Poisson-Bemoulli  convolution.  We  are  unable  to 
display  a closed  form  solution  for  the  MLE  here,  but  we  prove  that  any 
nontrivial  solution  (i.e.,  such  that  p 4 0,  1)  of  the  likelihood  equations 
satisfies  the  first  moment  condition 

p + 9 ■ S/n, 

that  is,  the  condition  that  results  from  equating  sample  and  population 
first  moments.  We  are  thus  able  to  reduce  our  numerical  search  for  the  MLE 
to  a one-dimensional  problem. 

Let  Xlt...,Xn  be  a random  sample  from  a Poisson-Bernoulli  convolution. 


The  likelihood  function  is  given  by 
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n -0  xi-1 

L(x,e,p)  - n z-f-, — ((l-p)e  + Px  ) 

1-1  xt*  1 


-n0  S-n  n 

V-2 — n ((l-p)e  + px.). 


n x * 
i 


Thus,  the  likelihood  equations  are 


A fa  l - -«  + + <i-p)  l p>i 


(2.10) 


n (x  -0) 

An  L - L — - — « 0. 

dp 


1 (i-p)©  + P\ 


(2.11) 


By  carrying  out  the  indicated  division,  equation  (2.11)  may  be  rewritten  as 


(9  + f (1-"»  ? (i-P)»V^ 


(2.12) 


provided  p 4 0.  Assuming  p 4 0,  1,  we  substitute  in  for  the  sum  in  (2.12) 
an  expression  for  this  sum  obtained  from  (2.10),  yielding 


2 - <»  + f - t2)  - o- 


This  is  equivalent  to 


or 


n 1 .ne-S+n,  _ 

p * 1-p  1 p ’ 


n(l-p)  - n»fS-n  n 
P(l-P)  " 
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which  reduces  to  the  moment  condition 

p + 0 - S/n. 

The  moment  condition  established  for  solutions  (0,p)  of  the  likelihood 
equations  is  used  in  our  numerical  search  for  the  HUE  by  substituting 
6 ■ S/n  - p into  equation  (2.11),  and  searching  for  solutions  of  the 
equation 


n (x, -S/n  + p) 

z — rr - o.  (2.13) 

x (S/n  - p)(l-p)  + pxt 

It  can  be  shown  that  the  point  (0,p)  * (S/n,  0)  is  always  a solution  of  the 
likelihood  equations,  so  that  our  procedure  for  finding  the  ML£  involves 
comparing  the  likelihood  at  this  point  with  the  likelihood  at  any  point 
(0,p)  for  which  p € (0,1]  is  a solution  of  (2.13). 


III.  RANDOM  GENERATION  AND  NUMERICAL  PROCEDURES 

Random  samples  of  size  50,  100  and  200  were  generated  from  the 
geometric-Bernoulli  and  Poisson-Bernoulli  convolutions  using  algorithms 
which  employ  random  variables  uniformly  distributed  on  [0,1].  Let  u 
represent  an  observed  uniform  variable.  Geometric  random  variables  were 
obtained  as  follows: 

1.  Set  k * 0. 

2.  Generate  u. 

3.  If  u > tt,  deliver  k as  G(tt);  otherwise,  set  k - k+1  and 
g>  to  2. 

Bernoulli  random  variables  were  generated  by 
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1 . Generate  u . 

2.  If  u < p,  deliver  k ■ 1 ta  fi(l,p);  otherwise  deliver  k - 0. 

Poisson  random  variables  were  generated  from  a sequence  of  uniform  variables 
as  follows: 

1.  Set  a = 1,  k = 0. 

2.  Generate  u. 

3.  Let  a = a • u. 

4.  If  a < e 6,  deliver  k as  P(8) ; If  a > e ®,  set  k - k+1  and 
go  to  2. 

It  is  easy  to  show  that  the  output  k obtained  by  this  algorithm  has  a 
Poisson  distribution  with  parameter  6.  The  algorithm  is  based  on  a technique 
proposed  by  Ahrens,  Dieter  and  Grube  (1970). 

The  numerical  procedure  used  to  calculate  the  MLR  for  the  parameters  of 
the  Poisson-Bemoulli  convolution  deserves  some  comment.  Initially,  the 
Newton-Raphson  algorithm  was  used,  but  it  was  found  to  be  inadequate. 

Several  data  sets  gave  rise  to  multiple  roots,  and  there  is  no  guarantee  that 
the  Newton-Raphson  procedure  converges  to  the  ML£ . The  fact  that 
(0,p)  « (S/n,  0)  is  always  a solution  of  the  likelihood  equations  seemed  to 
cause  difficulty.  There  were  a number  of  examples  in  which  NR  converged  to 
(S/n,  0)  when  the  MIR  was  elsewhere.  This  occurred  in  an  example  where  the 
sample  consisted  of  zeros  and  ones,  in  which  case  the  MLR  is  (0,p)  ■ (0,  S/n). 
The  method  of  moments  estimate  of  (0,p)  was  used  as  an  Initial  value  for  each 
sample. 

After  reducing  the  problem  to  one  dimension,  we  opted  for  a search 
procedure  which  is  guaranteed  to  find  a root  in  [0,1]  if  one  exists.  Interval 
halving  and  the  method  of  false  positions  were  considered.  Barnett  (1966) 
reconmends  the  method  of  false  positions,  but  makes  incorrect  claims  concerning 
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this  method.  He  claims  incorrectly  (see  p.  159)  that  one  can  guarantee 
that  the  computed  MLE  is  within  an  arbitrarily  small  distance  of  the  true 
MLE . By  partitioning  an  interval  Into  subintervals  of  length  e,  and  then 
applying  FP  to  each  subinterval,  one  may  find  one  root  in  each  of  several 
subintervals.  Even  if  one  of  these  roots  is  within  e of  the  true  MLE 
it  may  happen  that  among  all  the  roots  found,  the  one  at  which  the  likelihood 
is  largest  is  not  within  e of  the  true  MI£.  Interval  halving  suffers  from 
the  same  deficiency,  of  course,  but  was  adopted  because  of  its  logical 
simplicity.  The  interval  (.0001,  .9999)  was  partitioned  into  ten  intervals 
of  equal  length.  The  number  ten  was  selected  after  partitioning  the  interval 
into  90  subintervals  failed  to  produce  more  roots  in  any  of  20  data  sets  on 
which  we  applied  the  procedure.  A subinterval  was  identified  as  containing 
a root  if  the  function  in  (2.13)  took  on  different  signs  when  evaluated  at 
the  endpoints  of  the  interval.  When  the  existence  of  a root  was  determined, 
the  interval  was  halved,  and  the  same  criterion  applied  iteratively  until 
the  root  was  approximated  to  the  desired  level  of  accuracy.  Once  the  roots 
uncovered  by  our  procedure  were  well  approximated,  the  likelihood  was 
evaluated  at  each  root  and  at  the  endpoints  p ■ 0 and  p “ 1.  The  pair 
(9,p)  among  these  which  maximized  the  likelihood  was  dubbed  the  MLB,  and  was 
used  in  our  calculation  of  the  likelihood  ratio  statistic. 

IV.  POWER  ESTIMATION  FOR  x*  AND  LIKELIHOOD  RATIO  TESTS 

Five  hundred  samples  of  sizes  50,  100,  200  were  taken  from  the  two 

signal  plus  noise  distributions  under  study  at  a variety  of  values  of  the 

2 

parameter  pair.  The  approximate  x statistic  appropriate  for  each  test  was 

2 

tabulated  and  compared  to  the  5%  cutoff  of  the  x distribution  with  appropriate 


.-V'T.—- 
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degrees  of  freedom.  In  the  case  of  the  x goodness  of  fit  test,  the  limiting 

null  distribution  of  the  x statistic  when  parameters  are  estimated  by  MLE's 

is  given  by  Chernoff  and  Lehmann  (1954).  It  is  shown  there  that  the  limiting 

2 2 

distribution  is  "between"  x^  j and  where  k is  the  number  of  cells 

and  r is  the  number  of  estimated  parameters.  In  our  simulation,  the  test 

2 

which  uses  the  57.  cutoff  point  of  X^.2  was  use<*  throughout.  This  procedure 

slightly  overestimates  the  power.  For  the  likelihood  ratio  procedure, 

2 

-2  in  X was  assumed  to  have  Xj  as  its  null  distribution,  where  X is  the 
likelihood  ratio  statistic. 

For  each  of  the  parameter  pairs  indicated,  the  proportion  of  rejections 
of  the  hypothesis  H^:  p ■ 0 (postulating  a geometric  or  a Poisson  model  for 
the  data)  in  500  repetitions  was  tabulated.  The  results  are  as  follows: 


TABLE  I:  G(tt)  * B(l,p) 
N - 50 

TT 


.35 .50 .65 .80 


2 

X 

LR 

2 

X 

LR 

2 

X 

LR 

2 

X 

LR 

.00 

.040  j 

.040 

1 

.048  : 

.042 

.044  | 

.046 

1 

.056  • 

i 

.032 

.10 

.052  ! 

.086 

.072  ! 

.066 

.062  j 

.076 

.074  J 

i 

.068 

.20 

.196  | 

.254 

.144  S 

.208 

.094  | 

.140 

.064  • 

i 

.112 

.30 

.366  j 

.472 

.234  ! 
| 

.372 

.198  | 

.320 

.112  ! 

1 

.202 

.40 

.596  | 

.760 

.436  ! 

I 

.610 

.262  | 

.500 

.104  | 

.324 

.50 

.840  | 

.902 

.658  : 
• 

.860 

.418  ) 

.710 

.176  | 

.422 

.60 

.970  ! 

.998 

.818 : 

1 

.942 

.586  ] 

.870 

• 222  ! 

i 

.674 

.70 

.994  j 

.998 

.934  ; 

.988 

.752  | 

.966 

.204  | 

.798 

.80 

.992  ! 

1.00 

.934  | 

1.00 

.772  j 

1.00 

.250  | 

.938 

.90 

.840  j 

1.00 

.742  ! 

1 

1.00 

.588  ) 

1.00 

.298  | 

.994 

1.00 

.578  ; 

1.00 

.578  ! 

1 

1.00 

.484  ; 

1.00 

.332  ; 

1.00 

N - 200 
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.50 


1.0 


» 

1 

2 

X 

LR 

2 

X 

LR 

2 

X 

LR 

2 

X 

LR 

.00 

.044 

.024 

.052 

.038 

1 

.066  ! 

.030 

.046 

.034 

.10 

.064 

.040 

.068 

.044 

.052  ! 
1 

.024 

.048 

.050 

.20 

.072 

.112 

.040 

.062 

.048  ! 

1 

.050 

.042 

.038 

.30 

.128 

.234 

.056 

.132 

.044  i 

.056 

.044 

.054 

.40 

.348 

.588 

.110 

.254 

.050  ! 

1 

.134 

.046 

.086 

.50 

.652 

.868 

.260 

.514 

.052  ! 

1 

.202 

.072 

.108 

.60 

.952 

.998 

.490 

.810 

.120  i 

1 

.350 

.046 

.134 

.70 

1.00 

1.00 

.830 

.976 

.194  ! 

1 

.570 

.072 

.262 

.80 

1.00 

1.00 

.972 

.998 

.372  ! 

.858 

.076 

.372 

.90 

1.00 

1.00 

.996 

1.00 

.538  ! 

1 

.976 

.112 

.602 

1.00 

1.00 

1.00 

.898 

1.00 

.716  • 

i 

1.00 

.178 

.836 

2.0 


4.0 


2 

The  estimates  of  the  power  of  x and  likelihood  ratio  tests  tabulated 
above  lead  us  to  conclude  that  these  tests  have  the  following  general 
characteristics.  For  a fixed  value  of  the  Bernoulli  parameter,  the  power 
of  both  tests  decrease  as  the  geometric  parameter  n or  the  Poisson 
parameter  6 increases.  This  agrees  with  our  intuition  in  that  as  TT  or 
6 increases,  the  spread  of  the  distribution  increases,  and  it  is  correspond- 
ingly more  difficult  to  detect  a Bernoulli  component.  For  fixed  0 or  tt, 

the  power  of  the  likelihood  ratio  test  Increases  with  p.  It  is  interesting 
2 

to  note  that  the  x test  does  not  behave  similarly.  The  power  curve  seems 

to  be  parabolic  in  p for  fixed  tt  or  0,  and  the  power  is  often  seen  to 

be  higher  at  p * .7  or  .8  than  it  is  at  p ■ 1.  Insofar  as  the  case  p ■ 1 

might  be  considered  the  most  radical  departure  from  the  null  hypothesis  among 

2 

the  class  of  alternatives  being  considered,  this  feature  of  the  x test  is 


I 


undesirable 


21 


The  estimated  power  curves  of  the  likelihood  ratio  test  generally 

2 2 

(but  not  uniformly)  dominate  those  of  the  x test.  The  X test  dominates 

only  for  small  values  of  p,  and  this  is  easily  explained  by  the  fact  that 

the  likelihood  ratio  test  is  somewhat  conservative,  achieving  an  actual 

significance  level  in  the  neighborhood  of  .03  compared  to  the  nominal  level 
2 

of  .05,  while  the  x test  is  slightly  anti-conservative.  Thus,  the  power 
2 

curves  for  the  x test  are  generally  above  those  for  the  LR  test  at  p > 0, 

and  the  reversal  takes  place  at  some  p > 0.  The  superiority  of  the  LR  test 

is  occasionally  striking.  If  one  looks  at  the  ratio  of  estimated  power,  one 

finds  many  examples  of  this  ratio  exceeding  3.  For  samples  of  size  50 

generated  from  the  geometric-Bernoulli  convolution  with  TT  « .8  and  p ■ .7, 

we  were  able  to  detect  a Bernoulli  component  for  only  20%  of  the  samples 
2 

using  a x test,  while  with  the  IR  test,  it  was  detected  in  80%  of  the 

samples.  Similar  comparisons  can  be  made  from  our  simulation  of  samples 

from  the  Poisson-Bernoulli  convolution. 

The  class  of  signal  plus  noise  distributions  comprises  a huge  collection 

of  probability  models  which  contain  the  standard  models  as  degenerate  cases. 

There  are  many  sources  in  nature  of  data  which  might  reasonably  be  modeled 

by  a signal  plus  noise  distribution.  The  Geiger  counter  data  set  is  a 

classic  example  — a number  of  other  examples  are  mentioned  in  Samaniego  (1976), 
2 

The  power  of  the  x goodness  of  fit  test  at  signal  plus  noise  alternatives 
tends  to  be  low  --  at  least  by  comparison  with  other  available  tests. 
Approximate  likelihood  ratio  tests,  where  maximum  likelihood  estimates  of 
parameters  of  a signal  plus  noise  distribution  are  obtained  numerically, 
provide  a means  for  checking  the  goodness  of  fit  of  standard  probability 
models  against  a large  class  of  alternatives.  Our  simulation  results  Indicate 


2 

Chat  Che  dividends  from  implementing  such  tests  instead  of  the  x test 
can  be  substantial. 
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